Created Deep Recurrent Q-Network example by Douglas-Cho · Pull Request #85 · rlcode/reinforcement-learning

Douglas-Cho · 2018-12-25T06:03:09Z

This shows the way to implement Deep Recurrent Q-Network (DRQN) model for the Cartpole case. I had to expand the state input to include a few number of past state data and created a meaningful sequential input stream for Long and Short-Term Memory (LSTM) model. Otherwise, it did not work with just current state information. This sounds like violating the Markov property assumption but this does the job.

Create cartpole-drqn.py

the graph for drqn

saved weights for drqn

Douglas-Cho added 6 commits December 25, 2018 13:18

Merge pull request #1 from Douglas-Cho/Douglas-Cho-drqn-1

25b7598

Create cartpole-drqn.py

the graph for drqn

e9b27c1

Merge pull request #2 from Douglas-Cho/Douglas-Cho-drqn-2

a622919

the graph for drqn

saved weights for drqn

83f7e65

Merge pull request #3 from Douglas-Cho/Douglas-Cho-drqn-3

1e43fe0

saved weights for drqn

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Created Deep Recurrent Q-Network example#85

Created Deep Recurrent Q-Network example#85
Douglas-Cho wants to merge 6 commits intorlcode:masterfrom
Douglas-Cho:master

Douglas-Cho commented Dec 25, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Douglas-Cho commented Dec 25, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant